WORLD INTELLECTUAL PROPERTY ORGANIZATION 

International Bureau 



PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/56, 9/28, C12Q 1/40, CUD 
3/386 



Al 



(11) International Publication Number: 



WO 99/09183 



(43) International Publication Date: 25 February 1999 (25.02.99) 



(21) International Application Number: 



PCT/US98/16906 



(22) International Filing Date: 



14 August 1998 (14.08.98) 



(30) Priority Data: 

08/914,679 



19 August 1997 (19.08.97) 



US 



(71) Applicant: GENENCOR INTERNATIONAL, INC. [US/US]; 

4 Cambridge Place, 1870 South Winton Road, Rochester, 
NY 14618 (US). 

(72) Inventors: DAY, Anthony; 55 1 Monterey Boulevard, San 

Francisco, CA 94127 (US). SWANSON, Barbara; 465 
Burnett Avenue #7, San Francisco, CA 94131 (US), 

(74) Agent: STONE, Christopher, L.; Genencor International, Inc., 
925 Page Mill Road, Palo Alto, CA 94304-1013 (US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, GM, HR, HU, ID, IL, IS, JP, KE, KG, KP, KR, KZ, 
LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, MN, MW, 
MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, 
TM, TR, TT, UA, UG, UZ, VN, YU, ZW, ARIPO patent 
(GH, GM, KE, LS, MW, SD, SZ, UG, ZW), Eurasian patent 
(AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent 
(AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, 
LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, 
CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 



(54) Title: MUTANT ^-AMYLASE COMPRISING MODIFICATION AT RESIDUES CORRESPONDING TO A210, H405 AND/OR 
T412 IN BACILLUS LICHENIFORMIS 



(57) Abstract 



Alpha-amylase enzymes are disclosed in which one or more of residues corresponding to A210, H405 and T412 in Bacillus 
licheniformis are mutated. The disclosed alpha-amylase enzymes show altered or improved stability and/or activity profiles. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


C6te dTvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 99/09183 



PCT/US98/16906 



MUTANT a-AMYLASE COMPRISING MODIFICATION 
AT RESIDUES CORRESPONDING TO A210, H405 AND/OR T412 

IN BACILLUS LICHENIFORMIS 

FIELD OF THE INVENTION 

The present invention is directed to a-amylases having introduced therein 
mutations providing additional stability under certain conditions. It is specifically 
contemplated that the mutant will have altered performance characteristics such as altered 
stability and/or altered activity profiles. 

BACKGROUND OF THE INVENTION 

a-Amylases (a-1,4-glucan-4-glucanohydrolase, EC 3.2.1.1) hydrolyze internal a- 
1,4-glucosidic linkages in starch, largely at random, to produce smaller molecular weight 
malto-dextrins. a-Amylases are of considerable commercial value, being used in the initial 
stages (liquefaction) of starch processing; in alcohol production; as cleaning agents in 
detergent matrices; and in the textile industry for starch desizing. a-Amylases are 
produced by a wide variety of microorganisms including Bacillus and Aspergillus, with most 
commercial amylases being produced from bacterial sources such as Bacillus 
licheniformis, Bacillus amyloliquefaciens, Bacillus subtilis, or Bacillus stearothermophilus. 
In recent years, the preferred enzymes in commercial use have been those from Bacillus 
licheniformis because of their heat stability and performance under commercial operating 
conditions. 

In general, starch to fructose processing consists of four steps: liquefaction of 
granular starch, saccharification of the liquefied starch into dextrose, purification, and 
isomerization to fructose. The object of a starch liquefaction process is to convert a 
concentrated suspension of starch polymer granules into a solution of soluble shorter 
chain length dextrins of low viscosity. This step is essential for convenient handling with 
standard equipment and for efficient conversion to glucose or other sugars. To liquefy 
granular starch, it is necessary to gelatinize the granules by raising the temperature of the 
granular starch to over about 72°C. The heating process instantaneously disrupts the 
insoluble starch granules to produce a water soluble starch solution. The solubilized 
starch solution is then liquefied by a-amylase (EC 3.2.1.1.). 

A common enzymatic liquefaction process involves adjusting the pH of a granular 
starch slurry to between 6.0 and 6.5, the pH optimum of a-amylase derived from Bacillus 
licheniformis, with the addition of calcium hydroxide, sodium hydroxide or sodium 
carbonate. The addition of calcium hydroxide has the advantage of also providing calcium 
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ions which are known to stabilize the a-amylases against inactivation. Upon addition of a- 
amylases , the suspension is pumped through a steam jet to instantaneously raise the 
temperature to between 80-11 5°C. The starch is immediately gelatinized and, due to the 
presence of a-amylases, depolymerized through random hydrolysis of a(1-4) glycosidic 
bonds to a fluid mass which is easily pumped. 

In a second variation to the liquefaction process, a-amylase is added to the starch 
suspension, the suspension is held at a temperature of 80-1 00°C to partially hydrolyze the 
starch granules, and the partially hydrolyzed starch suspension is pumped through a jet at 
temperatures in excess of about 105°C to thoroughly gelatinize any remaining granular 
structure. After cooling the gelatinized starch, a second addition of a-amylase can be 
made to further hydrolyze the starch. 

A third variation of this process is called the dry milling process. In dry milling, 
whole grain is ground and combined with water. The germ is optionally removed by 
flotation separation or equivalent techniques. The resulting mixture, which contains 
starch, fiber, protein and other components of the grain, is liquefied using a-amylase. The 
general practice in the art is to undertake enzymatic liquefaction at a lower temperature 
when using the dry milling process. Generally, low temperature liquefaction is believed to 
be less efficient than high temperature liquefaction in converting starch to soluble dextrins. 

Typically, after gelatinization the starch solution is held at an elevated temperature 
in the presence of a-amylase until a DE of 10-20 is achieved, usually a period of 1-3 
hours. Dextrose equivalent (DE) is the industry standard for measuring the concentration 
of total reducing sugars, calculated as D-glucose on a dry weight basis. Unhydrolyzed 
granular starch has a DE of virtually zero, whereas the DE of D-glucose is defined as 100. 

The maximum temperature at which the starch solution containing a-amylase can 
be held depends upon the microbial source from which the enzyme was obtained and the 
molecular structure of the a-amylase molecule. a-Amylases produced by wild type strains 
of Bacillus subtilis or Bacillus amyloliquefaciens are typically used at temperatures no 
greater than about 90°C due to excessively rapid thermal inactivation above that 
temperature, whereas a-amylases produced by wild type strains of Bacillus licheniformis 
can be used at temperatures up to about 1 10°C. The presence of starch and calcium ion 
are known to stabilize a-amylases against inactivation. Nonetheless, a-amylases are used 
at pH values above 6 to protect against rapid inactivation. At low temperatures, a-amylase 
from Bacillus licheniformis is known to display hydrolyzing activity on starch substrate at 
pH values lower than 5. However, when the enzyme is used for starch hydrolysis at 
common jet temperatures, e.g., between 102°C and 109°C, the pH must be maintained 
above at least pH 5.7 to avoid excessively rapid inactivation. The pH requirement 
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unfortunately provides a narrow window of processing opportunity because pH values 
above 6.0 result in undesirable by-products, e.g., maltulose. Therefore, in reality, 
liquefaction pH is generally maintained between 5.9 and 6.0 to attain a satisfactory yield of 
hydrolyzed starch. 

Another problem relating to pH of liquefaction is the need to raise the pH of the 
starch suspension from about 4, the pH of a corn starch suspension as it comes from the 
wet milling stage, to 5.9-6.0. This pH adjustment requires the costly addition of acid 
neutralizing chemicals and also requires additional ion-exchange refining of the final starch 
conversion product to remove the chemical. Moreover, the next process step after 
liquefaction, typically saccharification of the liquefied starch into glucose with 
glucoamylase, requires a pH of 4-4.5; therefore, the pH must be adjusted down from 5.9- 
6.0 to 4-4.5; requiring additional chemical addition and refining steps. 

Subsequent to liquefaction, the processed starch is saccharified to glucose with 
glucoamylase. A problem with present processes occurs when residual starch is present 
in the saccharification mixture due to an incomplete liquefaction of the starch, e.g., 
inefficient amylose hydrolysis by amylase. Residual starch is highly resistant to 
glucoamylase hydrolysis. It represents a yield loss and interferes with downstream 
filtration of the syrups. 

Additionally, many a-amylases are known to require the addition of calcium ion for 
stability. This further increases the cost of liquefaction. 

In U.S. Patent No. 5,322,778, liquefaction between pH 4.0 and 6.0 was achieved 
by adding an antioxidant such as bisulfite or a salt thereof, ascorbic acid or a salt thereof, 
erythorbic acid, or phenolic antioxidants such as butylated hydroxyanisole, butylated 
hydroxytoluene, or a-tocopherol to the liquefaction slurry. According to this patent, sodium 
bisulfite must be added in a concentration of greater than 5mM. 

In U.S. Patent No. 5,180,669, liquefaction between a pH of 5.0 to 6.0 was achieved 
by the addition of carbonate ion in excess of the amount needed to buffer the solution to 
the ground starch slurry. Due to an increased pH effect which occurs with addition of 
carbonate ion, the slurry is generally neutralized by adding a source of hydrogen ion, for 
example, an inorganic acid such as hydrochloric acid or sulfuric acid. 

In PCT Publication No. WO 95/35382, a mutant a-amylase is described having 
improved oxidation stability and having changes at positions 104, 128, 187 and/or 188 in 
B. licheniformis a-amylase. 

In PCT Publication No. WO 96/23873, mutant a-amylases are described which 
have any of a number of mutations. 
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In PCT Publication No. WO 94/02597, a mutant a-amylase having improved 
oxidative stability is described wherein one or more methionines are replaced by any 
amino acid except cysteine or methionine. 

In PCT publication No. WO 94/18314, a mutant a-amylase having improved 
oxidative stability is described wherein one or more of the methionine, tryptophan, 
cysteine, histidine or tyrosine residues is replaced with a non-oxidizable amino acid. 

In PCT Publication No. WO 91/00353, the performance characteristics and 
problems associated with liquefaction with wild type Bacillus licheniformis a-amylase are 
approached by genetically engineering the a-amylase to include the specific substitutions 
Ala-1 1 1-Thr, His-133-Tyr and/or ThM49-lle. 

Studies using recombinant DNA techniques to explore which residues are 
important for the catalytic activity of amylases and/or to explore the effect of modifying 
certain amino acids within the active site of various amylases and glycosylases have been 
conducted by various researchers (Vihinen et al., J. Biochem ., Vol. 107, pp. 267-272 
(1990); Holm et al., Protein Engineering , Vol. 3, pp. 181-191 (1990); Takase et al., 
Biochemica et Biophvsica Acta , Vol. 1120, pp. 281-288 (1992); Matsui et al., FEBS 
Letters , Vol. 310, pp. 216-218 (1992); Matsui et al., Biochemistry . Vol. 33, pp. 451-458 
(1992); Sogaard et al., J. Biol. Chem ., Vol. 268, pp. 22480-22484 (1993); Sogaard et al., 
Carbohydrate Polymers . Vol. 21, pp. 137-146 (1993); Svensson, Plant Mol. Biol .. Vol. 25, 
pp. 141-157 (1994); Svensson et al., J. Biotech ., Vol. 29, pp. 1-37 (1993)). Researchers 
have also studied which residues are important for thermal stability (Suzuki et al., J. Biol. 
Chem . Vol. 264, pp. 18933-18938 (1989); Watanabe etal., Eur. J. Biochem .. Vol. 226, pp. 
277-283 (1994)); and one group has used such methods to introduce mutations at various 
histidine residues in a Bacillus licheniformis amylase, the rationale being that Bacillus 
licheniformis amylase which is known to be relatively thermostable when compared to 
other similar Bacillus amylases, has an excess of histidines and, therefore, it was 
suggested that replacing a histidine could affect the thermostability of the enzyme. This 
work resulted in the identification of stabilizing mutations at the histidine residue at the 
+133 position and the alanine residue at position +209 (Declerck et al., J. Biol . Chem .. Vol. 
265, pp. 15481-15488 (1990); FR 2 665 178-A1; Joyet et al., Bio/Technoloov . Vol. 10, pp. 
1579-1583 (1992)). 

Despite the advances made in the prior art, a need exists for an a-amylase which is 
more effective in commercial liquefaction processes but allowing activity at lower pH than 
currently practical. Additionally, a need exists for improved amylases having 
characteristics which makes them more effective under the conditions of detergent use. 
Because commercially available amylases are not acceptable under many conditions due 
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to stability problems, for example, the high alkalinity and oxidant (bleach) levels associated 
with detergents, or temperatures under which they operate, there is a need for an amylase 
having altered, and preferably increased, performance profiles under such conditions. 

SUMMARY OF THE INVENTION 

It is an object of the present invention to provide an a-amyiase having altered 
performance profiles. 

It is a further object of the present invention to provide an a-amylase having 
improved stability at high temperature. 

Accordingly, the present invention provides an a-amylase having introduced therein 
a mutation comprising an addition, substitution or deletion at a residue corresponding to 
A210, H405 and/or T412 in Bacillus licheniformis a-amylase. In a particularly preferred 
embodiment of the invention, the a-amylase is derived from a bacterial or a fungal source 
and comprises a substitution corresponding to Bacillus licheniformis. Most preferably, the 
a-amylase is derived from Bacillus and the mutations correspond to A210T, H405D and/or 
T412A in Bacillus licheniformis. 

The invention further comprises nucleic acids encoding such mutant amylases, 
vectors comprising such nucleic acids, host cells transformed with such vectors and 
methods of expressing mutant a-amylases utilizing such host cells. 

The invention further comprises the use of the mutant a-amylases according to the 
invention to liquefy starch in the starch processing pathway to glucose or other starch 
derivatives, as an additive in detergents such as laundry and dishwashing detergents, as a 
baking aid and for desizing of textiles. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates the DNA sequence of the gene for a-amylase from Bacillus 
licheniformis (NCIB 8061) (SEQ ID NO:1) and deduced amino acid sequence of the 
translation product (SEQ ID NO:2) as described by Gray et al., J. Bacteriology , Vol. 166, 
pp. 635-643 (1986). 

Figure 2 illustrates the amino acid sequence (SEQ ID NO:3) of the mature a- 
amylase enzyme from Bacillus licheniformis. 

Figure 3 illustrates an alignment of the primary structures of three Bacillus a- 
amylases. The Bacillus licheniformis a-amylase (Am-Lich) (SEQ ID NO:4) is described by 
Gray et al., J. Bacteriology , Vol. 166, pp. 635-643 (1986); the Bacillus amyloliquefaciens a- 
amylase (Am-Amylo) (SEQ ID NO:5) is described by Takkinen et al., J. Biol. Chem .. Vol. 
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258, pp. 1007-1013 (1983); and the Bacillus stearothermophilus a-amylase (Am-Stearo) 
(SEQ ID NO:6) is described by lhara et al., J. Biochem ., Vol. 98, pp. 95-103 (1985). 

Figure 4 illustrates plasmid pHP1 3 wherein Cm R refers to chloramphenicol 
resistance, Em R refers to erythromycin resistance and Rep pTA1060 refers to the origin of 
replication from plasmid pTA1060. 

Figure 5 illustrates the pBLapr plasmid wherein BL AA refers to Bacillus 
licheniformis a-amylase gene; aprE refers to the promoter and signal peptide encoding 
region of the aprE gene; AmpR refers to the ampicillin resistant gene from pBR322; and 
CAT refers to the chloramphenicol resistance gene from pC194. 

Figure 6 illustrates the pHP.BL plasmid carrying the gene for Bacillus licheniformis 
a-amylase. 

DETAILED DESCRIPTION 

"a-Amylase" means an enzymatic activity which cleaves or hydrolyzes the a(1- 
4)glycosidic bond, e.g., that in starch, amylopectin or amylose polymers. a-Amylase as 
used herein includes naturally occurring a-amylases as well as recombinant a-amylases. 
Preferred a-amylases in the present invention are those derived from Bacillus 
licheniformis, Bacillus amyloliquefaciens or Bacillus stearothermophilus, as well as fungal 
a-amylases such as those derived from Aspergillus (i.e., A. oryzae and A. niger). 

"Recombinant a-amylase" means an a-amylase in which the DNA sequence 
encoding the naturally occurring a-amylase is modified to produce a mutant DNA 
sequence which encodes the substitution, insertion or deletion of one or more amino acids 
in the a-amylase sequence compared to the naturally occurring a-amylase. 

"Expression vector" means a DNA construct comprising a DNA sequence which is 
operably linked to a suitable control sequence capable of effecting the expression of said 
DNA in a suitable host. Such control sequences may include a promoter to effect 
transcription, an optional operator sequence to control such transcription, a sequence 
encoding suitable mRNA ribosome-binding sites, and sequences which control termination 
of transcription and translation. A preferred promoter is the Bacillus subtilis aprE 
promoter. The vector may be a plasmid, a phage particle, or simply a potential genomic 
insert. Once transformed into a suitable host, the vector may replicate and function 
independently of the host genome, or may, in some instances, integrate into the genome 
itself. In the present specification, plasmid and vector are sometimes used 
interchangeably as the plasmid is the most commonly used form of vector at present. 
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However, the invention is intended to include such other forms of expression vectors 
which serve equivalent functions and which are, or become, known in the art. 

"Host strain" or "host cell" means a suitable host for an expression vector 
comprising DNA encoding the a-amylase according to the present invention. Host cells 
useful in the present invention are generally procaryotic or eucaryotic hosts, including any 
transformable microorganism in which the expression of a-amylase according to the 
present invention can be achieved. Specifically, host strains of the same species or genus 
from which the a-amylase is derived are suitable, such as a Bacillus strain. Preferably, an 
a-amylase negative Bacillus strain (genes deleted) and/or an a-amylase and protease 
deleted Bacillus strain (AamyE, Aapr, Anpr) is used. Host cells are transformed or 
transfected with vectors constructed using recombinant DNA techniques. Such 
transformed host cells are capable of either replicating vectors encoding the a-amylase 
and its variants (mutants) or expressing the desired a-amylase. 

"Liquefaction" or "liquefy" means a process by which starch is converted to shorter 
chain and less viscous dextrins. Generally, this process involves gelatinization of starch 
simultaneously with or followed by the addition of a-amylase. 

According to the present invention, a mutant a-amylase is provided that has 
introduced therein a substitution, addition or deletion at A210, H405 and/or T412. 
Deletion, addition or substitution of an amino acid as used herein refers to any 
modification of the amino acid sequence of the precursor a-amylase itself, but preferably 
refers to using genetic engineering to mutate a nucleic acid encoding the precursor a- 
amylase so as to encode the deleted, substituted or added residue in the expressed 
protein. The precursor a-amylases include naturally occurring a-amylases and 
recombinant a-amylases. Modification of the precursor DNA sequence which encodes the 
amino acid sequence of the precursor a-amylase can be by methods described herein and 
in commonly owned U.S. Patent Nos. 4,760,025 and 5,185,258, incorporated herein by 
reference. 

Also provided is a nucleic acid molecule (DNA) which encodes an amino acid 
sequence comprising the mutant a-amylase provided by the present invention, expression 
systems incorporating such DNA including vectors and phages, host cells transformed with 
such DNA, and anti-sense strands of DNA corresponding to the DNA molecule which 
encodes the amino acid sequence. Similarly, the present invention includes a method for 
producing a mutant a-amylase by expressing the DNA incorporated in an expression 
system which has been transformed into a host cell. The mutant a-amylase of the 
invention may be used in liquefaction of starch, as an ingredient in laundry detergents, 



WO 99/09183 



PCT/US98/16906 



~ 8 -- 

automatic dishwashing detergents, hard surface cleaning products, in food processing 
including baking applications, in textile processing including as a desize agent, or in any 
other application in which a-amylase activity is useful. 

The precursor a-amylase is produced by any source capable of producing a- 
amylase. Suitable sources of a-amylases are prokaryotic or eukaryotic organisms, 
including fungi, bacteria, plants or animals. Preferably, the precursor a-amylase is 
produced by a Bacillus: more preferably, by Bacillus licheniformis, Bacillus 
amyloliquefaciens or Bacillus stearothermophilus; most preferably, the precursor a- 
amylase is derived from Bacillus licheniformis. 

Homologies have been found between almost all endo-amylases sequenced to 
date, ranging from plants, mammals, and bacteria (Nakajima et al., Appl. Microbiol. 
Biotechnol., Vol. 23, pp. 355-360 (1986); Rogers, Biochem. Biophys. Res. Commun ., Vol. 
128, pp. 470-476 (1985); Janecek, Eur. J. Biochem ., Vol. 224, pp. 519-524 (1994)). There 
are four areas of particularly high homology in certain Bacillus amylases, as shown in 
Figure 3, wherein the underlined sections designate the areas of high homology. 
Sequence alignments have also been used to map the relationship between Bacillus 
endo-amylases (Feng et al., J. Molec. EvoL Vol. 35, pp. 351-360 (1987)). The relative 
sequence homology between Bacillus stearothermophilus and Bacillus licheniformis 
amylase is about 66% and that between Bacillus licheniformis and Bacillus 
amyloliquefaciens amylases is about 81%, as determined by Holm et al., Protein 
Engineering , Vol. 3, No. 3, pp. 181-191 (1990). While sequence homology is important, it 
is generally recognized that structural homology is also important in comparing amylases 
or other enzymes. For example, structural homology between fungal amylases and 
bacterial amylase has been suggested and, therefore, fungal amylases are encompassed 
within the present invention. 

Among others, addition, deletion or substitution at residues corresponding to A210, 
H405 and/or T412 in Bacillus licheniformis a-amylase are identified herein. Thus, specific 
residues such as A210 refer to an amino acid position number (i.e., +210) which 
references the number assigned to the mature Bacillus licheniformis a-amylase sequence 
illustrated in Figure 1 . The invention, however, is not limited to the mutation of the 
particular mature a-amylase of Bacillus licheniformis but extends to precursor a-amylases 
containing amino acid residues at positions which are equivalent to the particular identified 
residue in Bacillus licheniformis a-amylase. A residue of a precursor a-amylase is 
equivalent to a residue of Bacillus licheniformis a-amylase if it is either homologous (i.e., 
corresponds in position for either the primary or tertiary structure) or analogous to a 
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specific residue or portion of that residue in Bacillus licheniformis a-amylase (i.e., having 
the same or similar functional capacity to combine, react, or interact chemically or 
structurally). 

In order to establish homology to primary structure, the amino acid sequence of a 
precursor a-amylase is directly compared to the Bacillus licheniformis a-amylase primary 
sequence and particularly to a set of residues known to be invariant to all a-amylases for 
which sequences are known (see e.g., Figure 3). It is possible also to determine 
equivalent residues by tertiary structure analysis of the crystal structures reported for 
porcine pancreatic a-amylase (Buisson et al., EMBO Journal , Vol. 6, pp. 3909-3916 
(1987); Qian et al., Biochemistry , Vol. 33, pp. 6284-6294 (1994); Larson et al., J. Mol. 
Biol ., Vol. 235, pp. 1560-1584 (1994)); Taka-amylase A from Aspergillus oryzae (Matsuura 
et al., J. Biochem . (Tokyo), Vol. 95, pp. 697-702 (1984)); and an acid a-amylase from A. 
niger (Boel et al.. Biochemistry , Vol. 29, pp. 6244-6249 (1990)), with the former two 
structures being similar, and for barley a-amylase (Vallee et al., J. Mol. Biol ., Vol. 236, pp. 
368-371(1994); Kadziola, J. Mol. Biol ., Vol. 239, pp. 104-121 (1994)). Several preliminary 
studies have been published related to the secondary structure of a-amylase, i.e., (Suzuki 
et al., J. Biochem ., Vol. 108, pp. 379-381 (1990); Lee et al., Arch. Biochem. Biophvs , Vol. 
291, pp. 255-257 (1991); Chang et al., J. Mol. Biol ., Vol. 229, pp. 235-238 (1993); Mizuno 
et al., J. Mol. Biol ., Vol. 234, pp. 1282-1283 (1993)), and at least one structure has been 
published for crystalline Bacillus licheniformis a-amylase (Machius et al., J. Mol. Biol . Vol. 
246, pp. 545-549 (1995)), However, several researchers have predicted common super- 
secondary structures between glucanases (MacGregor et al., Biochem. J ., Vol. 259, pp. 
145-152 (1989)) and within a-amyiases and other starch-metabolising enzymes 
(Jaspersen, J. Prot. Chem . Vol. 12, pp. 791-805 (1993); MacGregor, Starke , Vol. 45, pp. 
232-237 (1993)); and sequence similarities between enzymes with similar super- 
secondary structures to a-amylases (Janecek, FEBS Letters , Vol. 316, pp. 23-26 (1993); 
Janecek et al., J. Prot. Chem ., Vol. 12, pp. 509-514 (1993)). A structure for the Bacillus 
stearothermophilus enzyme has been modeled on that of Taka-amylase A (Holm et al., 
Protein Engineering , Vol. 3, pp. 181-191 (1990)). The four highly conserved regions 
shown in Figure 3 contain many residues thought to be part of the active-site (Matsuura et 
al., J. Biochem . (Tokyo), Vol. 95, pp. 697-702 (1984); Buisson et al., EMBO Journal , Vol. 
6, pp. 3909-3916 (1987); Vihinen et al., J. Biochem ., Vol. 107, pp. 267-272 (1990)) 
including His +105; Arg +229; Asp +231; His +235; Glu +261 and Asp +328 under the 
Bacillus licheniformis numbering system. 
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a-Amylases according to the present invention which exhibit altered performance 
characteristics providing desirable and unexpected results are useful in the various 
applications for which a-amylases are commonly used. For example, a-amylases 
according to the present invention which exhibit altered performance characteristics at low 
pH, including improved thermostability, improved pH stability and/or improved oxidative 
stability, are useful in low pH liquefaction of starch. Enhanced thermostability will be 
useful in extending the shelf life of products which incorporate them. Enhanced oxidative 
stability or improved performance is particularly desirable in cleaning products, and for 
extending the shelf life of a-amylase in the presence of bleach, perborate, percarbonate or 
peracids used in such cleaning products. To the contrary, reduced thermal stability or 
oxidative stability may be useful in industrial processes which require the rapid and 
efficient quenching of amylolytic activity. 

a-Amylases of the present invention which exhibit improved low pH stability will be 
especially useful in starch processing and particularly in starch liquefaction. Conditions 
present during commercially desirable liquefaction processes characteristically include low 
pH, high temperature and potential oxidation conditions requiring a-amylases exhibiting 
improved low pH performance, improved thermal stability and improved oxidative stability. 
Accordingly, a-amylases according to the present invention which are particularly useful in 
liquefaction exhibit improved performance at a pH of less than about 6, preferably less 
than about 5.5, and most preferably less than about 5.0. Additionally, a-amylases 
according to the present invention which exhibit increased thermal stability at temperatures 
of between about 80-120°C, and preferably between about 100-1 10°C, and increased 
stability in the presence of oxidants will be particularly useful. 

Additional components known by those skilled in the art to be useful in liquefaction, 
including, for example, antioxidants, calcium, ions, salts or other enzymes such as 
endoglycosidases, cellulases, proteases, lipases or other amylase enzymes may be added 
depending on the intended reaction conditions. For example, combinations of the a- 
amylase according to the present invention with a-amylases from other sources may 
provide unique action profiles which find particular use under specific liquefaction 
conditions. In particular, it is contemplated that the combination of the a-amylase 
according to the present invention with a-amylase derived from Bacillus 
stearothermophilus will provide enhanced liquefaction at pH values below 5.5 due to 
complementary action patterns. 

During liquefaction, starch, specifically granular starch slurries from either a wet or 
dry milled process, is treated with an a-amylase of the present invention according to 
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known liquefaction techniques. Generally, in the first step of the starch degradation 
process, the starch slurry is gelatinized by heating at a relatively high temperature 
(between about 80 °C and about 1 10°C). After the starch slurry is gelatinized, it is 

liquefied using an a-amylase. 

In another embodiment of the present invention, detergent compositions in either 
liquid, gel or granular form, which comprise the a-amylase according to the present 
invention may be useful. Such detergent compositions will particularly benefit from the 
addition of an a-amylase according to the present invention which has increased thermal 
stability to improve shelf-life or increased oxidative stability such that the a-amylase has 
improved resistance to bleach or peracid compounds commonly present in detergents. 
Thus, a-amylase according to the present invention may be advantageously formulated 
into known powdered, liquid or gel detergents having a phi of between about 6.5 and 
about 12.0. Detergent compositions comprising the a-amylase according to the present 
invention may further include other enzymes such as endoglycosidases, cellulases, 
proteases, lipases or other amylase enzymes, particularly a-amylase derived from Bacillus 
stearothermophilus, as well as additional ingredients as generally known in the art. 

A preferred embodiment of the present invention further comprises, in addition to 
the substitution, addition or deletion of residues as provided herein, any one or more of the 
substitutions known in the art to confer stability or increased activity. For example, the 
deletion or substitution of a methionine residue or a tryptophan residue, for example M15, 
M 197 or W138 as described in WO 94/18314, the disclosure of which is incorporated by 
reference; substitution at H133Y as described in PCT Publication No. WO 91/00353; or 
substitution at A209 as described in DeClerck, et al., J. Biol. Chem ., Vol. 265, pp. 15481- 
15488 (1990); or any of the substitutions described in PCT Publication Nos. WO 
95/10603, WO 96/23873 and WO 96/23874. In particularly preferred embodiments, the a- 
amylase according to the present invention may further comprise a deletion or substitution 
at one or more residues corresponding to M15, A33, A52, S85, N96, V129, H133, S148, 
S187, N188, A209, A269 and/or A379 in Bacillus licheniformis. 

Embodiments of the present invention which comprise a combination of the a- 
amylase according to the present invention with protease enzymes preferably include 
oxidatively stable proteases such as those described in U.S. Re. 34,606, incorporated 
herein by reference, as well as commercially available enzymes such as DURAZYM (Novo 
Nordisk) and PURAFECT® OxP (Genencor International, Inc.). Methods for making such 
protease mutants (oxidatively stable proteases), and particularly such mutants having a 
substitution for the methionine at a position equivalent to M222 in Bacillus 
amyloliquefaciens, are described in U.S. Re. 34,606. 
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An additional embodiment of the present invention comprises DNA encoding an <x- 
amylase according to the present invention and expression vectors comprising such DNA. 
The DNA sequences may be expressed by operably linking them to an expression control 
sequence in an appropriate expression vector and employing that expression vector to 
transform an appropriate host according to well known techniques. A wide variety of 
host/expression vector combinations may be employed in expressing the DNA sequences 
of this invention. Useful expression vectors, for example, include segments of 
chromosomal, non-chromosomal and synthetic DNA sequences, such as the various 
known plasmids and phages useful for this purpose. In addition, any of a wide variety of 
expression control sequences are generally used in these vectors. For example, 
Applicants have discovered that a preferred expression control sequence for Bacillus 
transformants is the aprE signal peptide derived from Bacillus subtilis. 

A wide variety of host cells are also useful in expressing the DNA sequences of this 
invention. These hosts may include well known eukaryotic and prokaryotic hosts, such as 
strains of E. coli, Pseudomonas, Bacillus, Streptomyces, various fungi, yeast and animal 
cells. Preferably, the host expresses the a-amylase of the present invention extracellularly 
to facilitate purification and downstream processing. Expression and purification of the 
mutant a-amylase of the invention may be effected through art-recognized means for 
carrying out such processes. 

The improved a-amylases according to the present invention are contemplated to 
provide important advantages when compared to wild type Bacillus a-amylases. For 
example, one advantage is the increased activity found at low pH and high temperatures 
typical of common starch liquefaction methods. Other advantages may include increased 
high pH and oxidative stability which facilitates their use in detergents; more complete 
hydrolysis of starch molecules is achieved which reduces residual starch in the processing 
stream; improved stability in the absence of calcium ion; and that the addition of equal 
protein doses of a-amylase according to the invention may provide superior performance 
when compared to wild type Bacillus licheniformis a-amylase due to improvements in both 
specific activity and stability under stressed conditions. 

The following is presented by way of example and is not to be construed as a 
limitation to the scope of the claims. Abbreviations used herein, particularly three letter or 
one letter notations for amino acids are described in Dale, J.W., Molecular Genetics of 
Bacteria , John Wiley & Sons, (1989) Appendix B. 
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EXAMPLES 

EXAMPLE 1 
Construction Of Plasmid pHP.BL 

The a-amylase gene shown in Figure 1 was cloned from Bacillus licheniformis 

NCIB8061 (Gray et al., J. Bacteriology , Vol. 166, pp. 635-643 (1986)). The 1.72kb Pstl- 

Sstl fragment, encoding the last three residues of the signal sequence, the entire mature 

protein and the terminator region, was subcloned into M13mp18. A synthetic terminator 

was added between the Bell and Sstl sites using a synthetic oligonucleotide cassette of 

the form: 

5-GATCAAAACATAAAAAACCGGCCTTGGCCCCGCCGGTTTTTTATTATTTTTGAGCT-3' (SEQ ID NO:7) 
3'-TTTTGTATTTTTTGGCCGGAACCGGGGCGGCCAAAAAATAATAAAAAC-5' (SEQ ID NO:8) 

designed to contain the Bacillus amyloliquefaciens subtilisin transcriptional terminator 
(Wells et al., Nucleic Acid Research . Vol. 11, pp. 7911-7925 (1983)). 

The pBLapr plasmid was constructed carrying the gene for the Bacillus 

licheniformis a-amylase. As illustrated in Figure 5, pBLapr comprises a 6.1kb plasmid 
including the ampicillin resistance gene from pBR322 and the chloramphenicol resistance 
gene from pC194, the aprE promoter and the gene encoding for the Bacillus licheniformis 
a-amylase ("BL AA"). The aprE promoter was constructed from a 660bp Hindlll-Pstl 
fragment encoding for the promoter and signal sequence of the Bacillus subtilis alkaline 
protease. The Pstl site was removed, and an Sfil site added close to the aprEJBL AA 
junction. The BL AA gene comprises the 1720 bp Pstl-Sstl fragment described above. In 
the work described herein, pBLapr was constructed with an Sfil site adjacent to the 5' end 
of the start of the coding sequence for the mature amylase gene. Specifically, the 5' end 
of the pBLapr construction was subcloned on an EcoRI-Sstll fragment from pBLapr into 
M13BM20 (Boehringer Mannheim) to obtain a coding-strand template for the mutagenic 
oligonucleotide below: 

5'- CCC ATT AAG ATT GGC CGC CTG GGC C GA CAT GTT GCT GG - 3' (SEQ ID NO: 9) 

This primer introduced an Sfil site (indicated by underlining) which allowed correct forms to 
be screened for by the presence of this unique restriction site. Subcloning the EcoRI-Sstll 
fragment back into the pBLapr vector gave a version of the plasmid containing an Sfil site. 

Plasmid pHP13 (Haima et al., Mol. Gen. Genet ., Vol. 209, pp. 335-342 (1987)) 
(Figure 4) was digested with restriction enzymes EcoRI and Hindlll and the resulting vector 
purified on a polyacrymide gel and then eluted. Plasmid pBLapr was digested with Hindlll, 
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Asp718 and in a separate incubation with Asp718, EcoRI and gel purified. Two bands, 
Hindlll-Asp718 (1203 bp) and Asp718-EcoRI (1253 bp)were gel purified, eluted from the 
gel and ligated into the vector by a 3-way ligation, to give plasmid pHP.BL, the plasmid 
used in expression of the a-amylase (Figure 6). 

EXAMPLE 2 

Construction Of Plasmid Encoding a-Amylase 
Comprising A210T/H405A/T412D 

A pBLapr plasmid having threonine substituted for methionine at amino acid 15 

was constructed according to U.S. Patent Application Serial No. 08/194,664 (PCT 

Publication No. WO 94/18314). To introduce the mutations, the following mutagenic 

primers encoding for substitutions of A210T/M405D/T412A are used together with non- 

mutagenic primers to introduce the desired mutations into linear multiple tandem repeats 

of the plasmid by the method of multimerization as described below. 

H405D (L) 

(411) CCA GCC GAC AAT GTC ATG GTC GTC GAA ATA ATC (401 ) (SEQ ID NO: 10) 
A210T (R) 

(206) CCT GAT GTC GCA ACA GAA ATT AAG AGA TGG (215) (SEQ ID NO:1 1) 
T412AfU 

(416) GTC GCC TTC CCT TGC CCA GCC GAC AAT GTC (407) (SEQ ID NO:12) 

A fragment starting at the appropriate mutagenic primer for the desired mutation 
(shown above) and ending at the end of the non-mutagenic primer is generated by PCR. 
This fragment is gel purified and used to generate long, linear tandem repeats of the 
plasmid encoding the desired mutations as follows: 

The vector (pBLapr) is linearized by restriction digest (Sal I) and purified using 
Qiagen kits. The multimerization reactions typically contain 5.4 mM Tris buffer pH 8.0, 1x 
XL buffer (Perkin Elmer, Branchburg, NJ), 0.2 mM dNTPs, 1.1 mM Mg(OAc) 2 , 3 ng/^il 
incoming fragment, 0.15 ng/jxl linearized vector, 4 U rTth DNA polymerase, XL (Perkin 
Elmer) in 100 jxl reaction mixture. PCR reactions are typically performed in a thermocycler 
under the following conditions: 20 cycles (15s 94°C, 5 min 68°C) and 15 cycles (15s 94°C, 
10 min 68°C). 

The resulting multimers are transformed directly into B. subtilis competent cells 
using standard techniques. Plasmid DNA is isolated from the transformants using 
standard techniques. 

Mutations were confirmed by dideoxy sequencing (Sanger et al., Proc. Natl. Acad. 
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Sci. U.S.A .. Vol. 74, pp. 5463-5467 (1977)). 

EXAMPLE 3 

Transformation Of Plasmids Into Bacillus subtilis, 
Expression And Purification of Mutant a-Amvlase 

a-Amylase may be expressed in Bacillus subtilis after transformation with the 
plasmids described above. pHP13 is a plasmid able to replicate in E. coli and in Bacillus 
subtilis. Plasmids containing different variants were constructed using £. coli strain 
MM294, the plasmids isolated and then transformed into Bacillus subtilis as described in 
Anagnostopoulos et al., J. Bacter ., Vol. 81, pp. 741-746 (1961). The Bacillus strain had 
been deleted for two proteases (Aapr, Anpr) (see e.g., Ferrari et al., U.S. Patent No. 
5,264,366) and for amylase (AamyE) (see e.g., Stahl et al., J. Bacter ., Vol. 158, pp. 411- 
418 (1984)). After transformation, the sacU(Hy) mutation (Henner et al., J. Bacter ., Vol., 
170, pp. 296-300 (1988)) was introduced by PBS-1 mediated transduction (Hoch, J. Bact ., 
Vol. 154, pp. 1513-1515 (1983)). 

Secreted amylase was recovered from Bacillus subtilis cultures as follows: Sodium 
chloride was added to the culture supernatant to 20mM and the pH was adjusted to 
approximately 7.0 with 1M tris buffer, pH 7.2. The supernatant was then heated to 70°C 
for 15 minutes, and the precipitate removed by centrifugation. Ammonium sulphate was 
added the supernatant to 1.3M followed by 20ml phenyl sepharose fast flow 6 (high 
substitution) resin (Pharmacia). Following agitation, resin was separated by filtration, and 
washed in 1M ammonium sulphate, 20mM ammonium acetate pH 7.0, 5mM calcium 
chloride. The bound amylase was eluted into 20mM ammonium acetate pH 7.0, 5mM 
calcium chloride, and precipated by addition of ammonium sulphate to 70% saturation. 
The precipated material was pelleted by centrifugation, redissolved in a minimum volume 
of 20mM ammonium acetate pH 7.0, 5mM calcium chloride and dialysed against the same 
buffer. 

Concentration was determined using the soluble substrate assay, assuming wild- 
type specific activity. 

EXAMPLE 4 
Assay For Determining a-Amylase Activity 
Soluble Substrate Assay : A rate assay was developed based on an end-point 
assay kit supplied by Megazyme (Aust.) Pty. Ltd. A vial of substrate (p-nitrophenyl 
maltoheptaoside, BPNPG7) was dissolved in 10ml of sterile water followed by a 1:4 
dilution in assay buffer (50mM maleate buffer, pH 6.7, 5mM calcium chloride, 0.002% 
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Tween20). Assays were performed by adding 10|xl of amylase to 790|il of the substrate in 
a cuvette at 25°C. Rates of hydrolysis were measured as the rate of change of 
absorbance at 410nm, after a delay of 75 seconds. The assay was linear up to rates of 
0.2 absorption units/min. 

a-Amylase protein concentration was measured using the standard Bio-Rad Assay 
(Bio-Rad Laboratories) based on the method of Bradford, Anal. Biochem .. Vol. 72, p. 248 
(1976) using bovine serum albumin standards. 

EXAMPLE 5 

Preparation and Testing of Additional Mutant 
Alpha-Amvlases for Thermal Stability 

Mutant B. licheniformis alpha-amylase was prepared having substitutions at 
A210T/H405D/T412A. Thermal inactivation rate for the mutant was measured according 
to the following procedure. Amylase stock solutions were dialysed extensively into 20 mM 
ammonium acetate, 4 mM CaCI 2 pH 6.5. Each sample was stored at 4°C. For 
measurement of stability, this stock was diluted >50fold into 50mM ammonium acetate, 
5mM CaCI 2 , 0.02% Tween 20 pH 4.8 to a final concentration of between 30 and 50 jug/ml. 
Six 100|ul aliquots were put into eppendorf tubes and placed into a water bath or hot block 
at 83 ° C. The eppendorf tubes were removed at regular, measured intervals of between 
30 seconds and 5 minutes and placed on ice to stop the inactivation. The residual activity 
was assayed using a soluble substrate as described in Example 4. The natural log of the 
activity was plotted against time of incubation, and the rate constant for inactivation 
obtained from the slope of the straight line. Results are provided in Table 1 . 



TABLE 1 





Relative Half-Life 


Amylase 


Exp. #1 


Exp. #2 


wild type 


1.00 


1.00 


wild type 


1.01 


XX 


A210T/H405D/ 
T412A 


1.06 


1.05 



As shown in Table 1, mutant enzymes having introduced therein the mutations 
according to the invention have significantly improved stability under the conditions of the 
assay. 
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CLAIMS 

1 . A mutant a-amylase which is derived from a precursor a-amylase by 
the deletion, substitution or addition to said precursor a-amylase of a residue 
corresponding to A210, H405 and/or T412 in Bacillus licheniformis a-amylase. 

2. The mutant a-amylase according to claim 1 , wherein said mutation 
comprises deletion, substitution or addition at two or more of A210, H405 and/or 
T412 in Bacillus licheniformis a-amylase. 

3. The a-amylase according to claim 2, wherein said a-amylase 
comprises a substitution corresponding to A210T/H405D/T412A in Bacillus 
licheniformis a-amylase. 

4. The a-amylase according to claim 1, wherein said a-amylase is 
derived from a bacterial or fungal source. 

5. The a-amylase according to claim 1, wherein said a-amylase is 
derived from Bacillus. 

6. The a-amylase according to claim 5, wherein said a-amylase is 
derived from Bacillus licheniformis, Bacillus stearothermophilus or Bacillus 
amyloliquefaciens. 

7. The a-amylase according to claim 1 wherein said a-amylase further 
comprises the deletion or substitution of a residue corresponding to M15, A33, A52, 
S85, N96, V129, H133, S148N, S187, N188, A209, A269 and/or A379 in Bacillus 
licheniformis a-amylase. 

8. The a-amylase according to claim 1, wherein substitution further 
comprises substituting or deleting a residue corresponding to M15T, W138Y and/or 
M197T in Bacillus licheniformis. 
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9. A DNA encoding the a-amylase according to claim 1. 

10. An expression vector comprising the DNA of claim 9. 

11. A host cell transformed with the expression vector of claim 10. 

12. An a-amylase according to claims 1 , 3 or 7 having enhanced low pH 
performance and/or increased thermostability. 

13. A detergent composition comprising the a-amylase according to 

claim 1. 

14. The detergent composition according to claim 13, wherein said 
detergent is useful for cleaning soiled laundry and/or soiled dishes. 

15. A method of liquefying starch comprising contacting a slurry of starch 

with the a-amylase according to claim 1. 
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10 30 50 

AGCTT 6AAGAA6T G AAG AAGC AG AG AGGC T AT T GAATAAAT G AGT AG AAAGC GC C AT AT C 

70 90 110 

GGCGCTTTTCTTTTGGAAGAAAATATAGGGAAAATGGTACTTGTTAAAAATTCGGAATAT 

130 150 170 

TTATACAACATCATATGTTTCACATTGAAAGGGGAGGAGAATCATG2LAACAACAAAAACG 

M K Q Q K R 

190 210 230 

GCTTTACGCCCGATTGCTGACGCTGTTATTTGCGCTCATCTTCTTGCTGCCTCATTCTGC 
LYARLIiTLLFALIFLIiPHSA 

250 270 290 

AGCAGCGGCGGCAAATCTTAATGGGACGCTGATGCAGTATTTTGAATGGTACATGCCCAA 
AAAANIiNGTLMQYFEWYMPN 

310 330 350 

TGACGGCCAACATTGGAAGCGTTTGCAAAACGACTCGGCATATTTGGCTGAACACGGTAT 
DGQHWKRLQNDSAYLAEHGI 

370 390 410 

TACTGCCGTCTGGATTCCCCCGGCATATAAGGGAACGAGCCAAGCGGATGTGGGCTACGG 
TAVWI PPAYKGTSQADVGYG 

430 450 470 

TGCTTACGACCTTTATGATTTAGGGGAGTTTCATCAAAAAGGGACGGTTCGGACAAAGTA 
AYDLYDLGEFHQKGTVRTKY 

490 510 530 

CGGCACAAAAGGAGAGCTGCAATCTGCGATCAAAAGTCTTCATTCCCGCGACATTAACGT 
GTKGELQSAIKSLHSRDINV 

550 570 590 

TTACGGGGATGTGGTCATCAACCACAAAGGCGGCGCTGATGCGACCGAAGATGTAACCGC 
YGDVVINHKGGADATEDVTA 

610 630 650 

GGTTGAAGTCGATCCCGCTGACCGCAACCGCGTAATTTCAGGAGAACACCTAATTAAAGC 
VEVDPADRNRVI SGEHLIKA 

670 690 710 

CTGGACACATTTTCATTTTCCGGGGCGCGGCAGCACATACAGCGATTTTAAATGGCATTG 
WTHFHFPGRGSTYSDFKWHW 

730 750 770 

GTACCATTTTGACGGAACCGATTGGGACGAGTCCCGAAAGCTGAACCGCATCTATAAGTT 
YHFDGTDWDESRKLNRIYKF 

790 810 830 

TCAAGGAAAGGCTTGGGATTGGGAAGTTTCCAATGAAAACGGCAACTATGATTATTTGAT 
QGKAWDWEVSNENGNYDYLM 

FIG.. 1A 

SUBSTITUTE SHEET (RULE 26) 
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850 870 890 

GTATGCCGACATCGATTATGACCATCCTGATGTCGCAGCAGAAATTAAGAGATGGGGCAC 
YADIDYDHPDVAAEIKRWGT 

910 930 950 

TTGGTATGCCAATGAACTGCAATTGGACGGTTTCCGTCTTGATGCTGTCAAACACATTAA 
WYANELQLDGFRLDAVKHIK 

970 990 1010 

ATTTTCTTTTTTGCGGGATTGGGTTAATCATGTCAGGGAAAAAACGGGGAAGGAAATGTT 
FSFLRDWVNHVREKTGKEMF 

1030 1050 1070 

TACGGTAGCTGAATATTGGCAGAATGACTTGGGCGCGCTGGAAAACTATTTGAACAAAAC 
TVAEYWQNDLGALENYLNKT 

1090 1110 1130 

AAATTTTAATCATTCAGTGTTTGACGTGCCGCTTCATTATCAGTTCCATGCTGCATCGAC 
NFNHSVFDVPLHYQFHAAST 

1150 1170 1190 

ACAGGGAGGCGGCTATGATATGAGGAAATTGCTGAACGGTACGGTCGTTTCCAAGCATCC 
QGGGYDMRKLLNGTVVSKHP 

1210 1230 1250 

GTTGAAATCGGTTACATTTGTCGATAACCATGATACACAGCCGGGGCAATCGCTTGAGTC 
LKSVTFV DNHDTQPGQSLES 

1270 1290 1310 

GACTGTCCAAACATGGTTTAAGCCGCTTGCTTACGCTTTTATTCTCACAAGGGAATCTGG 
TVQTWFKPLAYAFILTRESG 

1330 1350 1370 

ATACCCTCAGGTTTTCTACGGGGATATGTACGGGACGAAAGGAGACTCCCAGCGCGAAAT 
YPQVFYGDMYGTKGDSQREI 

1390 1410 1430 

TCCTGCCTTGAAACACAAAATTGAACCGATCTTAAAAGCGAGAAAACAGTATGCGTACGG 
PALKHKIEPILKARKQYAYG 

1450 1470 1490 

AGCACAGCATGATTATTTCGACCACCATGACATTGTCGGCTGGACAAGGGAAGGCGACAG 
AQHDYFDHHDIVGWTREGDS 

1510 1530 1550 

CTCGGTTGCAAATTCAGGTTTGGCGGCATTAATAACAGACGGACCCGGTGGGGCAAAGCG 
SVANSGLAALITDGPGGAKR 

1570 1590 1610 

AATGTATGTCGGCCGGCAAAACGCCGGTGAGACATGGCATGACATTACCGGAAACCGTTC 
MYVGRQNAGETWHDITGNRS 

1630 1650 1670 

GGAGCCGGTTGTCATCAATTCGGAAGGCTGGGGAGAGTTTCACGTAAACGGCGGGTCGGT 
EPVVINSEGWGEFHVNGGSV 

F/G._ IB 

SUBSTITUTE SHEET (RULE 26) 
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1690 1710 1730 

TTCAATTTATGTTCAAAGATAGAAGAGCAGAGAGGACGGATTTCCTGAAGGAAATCCGTT 
S I Y V Q R * 

1750 1770 1790 

TTTTTATTTTGCCCGTCTTATAAATTTCTTTGATTACATTTTATAATTAATTTTAACAAA 

1810 1830 1850 

GTGTCATCAGCCCTCAGGAAGGACTTGCTGACAGTTTGAATCGCATAGGTAAGGCGGGGA 

1870 1890 1910 

TGAAATGGCAACGTTATCTGATGTAGCAAAGAAAGCAAATGTGTCGAAAATGACGGTATC 

1930 1950 
GCGGGT GAT C AATC AT C C T G AGACTGTGAC GG AT GAATTGAAAAAGC T 



FIG..1C 
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10 30 50 

ANLNGTLMQYFEWYMPNDGQHWKRLQNDSAYLAEHGITAVWIPPAYKGTSQADVGYGAYD 

70 90 HO 

LYDLGEFHQKGTVRTKYGTKGELQSAIKSLHSRDINVYGDWINHKGGADATEDVTAVEV 

130 150 170 

DPADRNRVISGEHLIKAWTHFHFPGRGSTYSDFKWHWYHFDGTDWDESRKLNRIYKFQGK 

190 210 230 

AWDWEVSNENGNYDYLMYADIDYDHPDVAAEIKRWGTWYANELQLDGFRLDAVKHIKFSF 

250 270 290 

LRDWVNHVREKTGKEMFTVAEYWQNDLGALENYLNKTNFNHSVFDVPLHYQFHAASTQGG 

310 330 350 

GYDMRKLLNGTWSKHPLKSVTFVDNHDTQPGQSLESTVQTWFKPLAYAFILTRESGYPQ 

370 390 410 

VFYGDMYGTKGDSQREIPALKHKIEPILKARKQYAYGAQHDYFDHHDIVGWTREGDSSVA 

430 450 470 

NSGLAALITDGPGGAKRMYVGRQNAGETWHDITGNRSEPWINSEGWGEFHVNGGSVSIY 



VQR 

FIG...2 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Anthony G . Day 

Barbara A. Swanson 

(ii) TITLE OF INVENTION: MUTANT ALPHA-AMYLASE COMPRISING MODIFICATION 

AT RESIDUES CORRESPONDING TO A210, H405 

AND /OR 

T412 IN BACILLUS LICHENIFORMIS 
(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDING ADDRESS: 

(A) ADDRESSEE: Genencor International, Inc. 

(B) STREET: 92 5 Page Mill Road 

(C) CITY: Palo Alto 

( D ) STATE : CA 

( E ) COUNTRY : USA 

(F) ZIP: 94304-1013 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: To Be Assigned 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Stone, Christopher 

(B) REGISTRATION NUMBER: 35,696 

(C) REFERENCE /DOCKET NUMBER: GC387 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (650) 846-7555 

(B) TELEFAX: (650) 845-6504 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1968 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
AGCTTGAAGA AGTGAAGAAG CAGAGAGGCT AT TG AAT AAA TGAGTAGAAA GCGCCATATC 60 



GGCGCTTTTC TTTTGGAAGA AAATATAGGG AAAATGGTAC TTGTTAAAAA TTCGGAATAT 120 
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TTATACAACA TCATATGTTT CACATTGAAA GGGGAGGAGA ATCATGAAAC AACAAAAACG 180 

GCTTTACGCC CGATTGCTGA CGCTGTTATT TGCGCTCATC TTCTTGCTGC CTCATTCTGC 240 

AGCAGCGGCG GCAAATCTTA ATGGGACGCT GATGCAGTAT TTTGAATGGT ACATGCCCAA 300 

TGACGGCCAA CATTGGAAGC GTTTGCAAAA CGACTCGGCA TATTTGGCTG AACACGGTAT 360 

TACTGCCGTC TGGATTCCCC CGGCATATAA GGGAACGAGC CAAGCGGATG TGGGCTACGG 42 0 

TGCTTACGAC CTTTATGATT TAGGGGAGTT TCATCAAAAA GGGACGGTTC GGACAAAGTA 480 

CGGCACAAAA GGAGAGCTGC AATCTGCGAT CAAAAGTCTT CATTCCCGCG A CAT T AACGT 540 

TTACGGGGAT GTGGTCATCA ACCACAAAGG CGGCGCTGAT GCGACCGAAG ATGTAACCGC 600 

GGTTGAAGTC GATCCCGCTG ACCGCAACCG CGTAATTTCA GGAGAACACC TAATTAAAGC 660 

CTGGACACAT TTTCATTTTC CGGGGCGCGG C AG CAC AT AC AGCGATTTTA AATGGCATTG 720 

GTACCATTTT GACGGAACCG ATTGGGACGA GTCCCGAAAG CTGAACCGCA TCTATAAGTT 780 

TCAAGGAAAG GCTTGGGATT GGGAAGTTTC CAATGAAAAC GGCAACTATG ATTATTTGAT 840 

GTATGCCGAC ATCGATTATG ACCATCCTGA TGTCGCAGCA GAAATTAAGA GATGGGGCAC 900 

TTGGTATGCC AATGAACTGC AATTGGACGG TTTCCGTCTT GATGCTGTCA AACACATTAA 960 

ATTTTCTTTT TTGCGGGATT GGGTTAATCA TGTCAGGGAA AAAACGGGGA AGGAAATGTT 1020 

TACGGTAGCT G AAT AT TGG C AGAATGACTT GGGCGCGCTG GAAAACTATT TGAACAAAAC 1080 

AAATTTTAAT CAT T C AG TG T TTGACGTGCC GCTTCATTAT CAGTTCCATG CTGCATCGAC 1140 

ACAGGGAGGC GGCTATGATA TGAGGAAATT GCTGAACGGT ACGGTCGTTT CCAAGCATCC 1200 

GTTGAAATCG GTTACATTTG TCGATAACCA TGATACACAG CCGGGGCAAT CGCTTGAGTC 12 60 

GACTGTCCAA ACATGGTTTA AGCCGCTTGC TTACGCTTTT ATTCTCACAA GGGAATCTGG 1320 

ATACCCTCAG GTTTTCTACG GGGATATGTA CGGGACGAAA GGAGACTCCC AGCGCGAAAT 1380 

TCCTGCCTTG AAACACAAAA TTGAACCGAT CTTAAAAGCG AGAAAACAGT ATGCGTACGG 1440 

AGCACAGCAT GATTATTTCG ACCACCATGA CATTGTCGGC TGGACAAGGG AAGGCGACAG 1500 

CTCGGTTGCA AATTCAGGTT TGGCGGCATT AATAACAGAC GGACCCGGTG GGGCAAAGCG 1560 

AATGTATGTC GGCCGGCAAA ACGCCGGTGA GACATGGCAT GACATTACCG GAAACCGTTC 1620 

GGAGCCGGTT G T CAT C AAT T CGGAAGGCTG GGGAGAGTTT CACGTAAACG GCGGGTCGGT 1680 

TTCAATTTAT GTTCAAAGAT AG AAG AG C AG AGAGGACGGA TTTCCTGAAG GAAATCCGTT 1740 

TTTTTATTTT GCCCGTCTTA TAAATTTCTT TGATTACATT TTATAATTAA TTTTAACAAA 1800 

GTGTCATCAG CCCTCAGGAA GGACTTGCTG ACAGTTTGAA TCG CATAGGT AAGGCGGGGA 1860 
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TGAAATGGCA ACGTTATCTG ATGTAGCAAA GAAAGCAAAT GTGTCGAAAA TGACGGTATC 1920 
GCGGGTGATC AATCATCCTG AGACTGTGAC GGATGAATTG AAAAAGCT 1968 



(2) INFORMATION FOR SEQ ID NO:2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DE 

Met Lys Gin Gin 
1 

Ala Leu lie Phe 

20 

Asn Gly Thr Leu 
35 

His Trp Lys Arg 
50 



CRIPTION: SEQ I 

Lys Arg Leu Tyr 
5 

Leu Leu Pro His 



Met Gin Tyr Phe 

40 

Leu Gin Asn Asp 
55 



NO: 2 : 

Ala Arg Leu Leu 
10 

Ser Ala Ala Ala 
25 

Glu Trp Tyr Met 



Ser Ala Tyr Leu 

60 



Thr Leu Leu Phe 
15 

Ala Ala Asn Leu 
30 

Pro Asn Asp Gly 
45 

Ala Glu His Gly 



lie Thr Ala Val 
65 

Asp Val Gly Tyr 



Gin Lys Gly Thr 

100 

Ser Ala lie Lys 
115 



Trp lie Pro Pro 
70 

Gly Ala Tyr Asp 
85 

Val Arg Thr Lys 



Ser Leu His Ser 

120 



Ala Tyr Lys Gly 
75 

Leu Tyr Asp Leu 
90 

Tyr Gly Thr Lys 
105 

Arg Asp lie Asn 



Thr Ser Gin Ala 

80 

Gly Glu Phe His 
95 

Gly Glu Leu Gin 
110 

Val Tyr Gly Asp 
125 



Val Val lie Asn 
130 

Ala Val Glu Val 
145 

His Leu lie Lys 



Thr Tyr Ser Asp 

180 

Trp Asp Glu Ser 
195 



His Lys Gly Gly 
135 

Asp Pro Ala Asp 
150 

Ala Trp Thr His 
165 

Phe Lys Trp His 



Arg Lys Leu Asn 

200 



Ala Asp Ala Thr 

140 

Arg Asn Arg Val 
155 

Phe His Phe Pro 
170 

Trp Tyr His Phe 
185 

Arg lie Tyr Lys 



Glu Asp Val Thr 



lie Ser Gly Glu 

160 

Gly Arg Gly Ser 
175 

Asp Gly Thr Asp 
190 

Phe Gin Gly Lys 
205 
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Ala Trp Asp Trp 
210 

Met Tyr Ala Asp 
225 

Lys Arg Trp Gly 



Arg Leu Asp Ala 

260 

Val Asn His Val 
275 

Glu Tyr Trp Gin 
290 

Thr Asn Phe Asn 
305 

His Ala Ala Ser 



Asn Gly Thr Val 

340 

Asp Asn His Asp 
355 

Thr Trp Phe Lys 
370 

Gly Tyr Pro Gin 
385 

Ser Gin Arg Glu 



Lys Ala Arg Lys 

420 

His His Asp lie 
435 

Asn Ser Gly Leu 
450 

Arg Met Tyr Val 
465 

Thr Gly Asn Arg 



Glu Val Ser Asn 
215 

lie Asp Tyr Asp 
230 

Thr Trp Tyr Ala 
245 

Val Lys His lie 



Arg Glu Lys Thr 

280 

Asn Asp Leu Gly 
295 

His Ser Val Phe 
310 

Thr Gin Gly Gly 
325 

Val Ser Lys His 



Thr Gin Pro Gly 

360 

Pro Leu Ala Tyr 
375 

Val Phe Tyr Gly 
390 

lie Pro Ala Leu 
405 

Gin Tyr Ala Tyr 



Val Gly Trp Thr 

440 

Ala Ala Leu lie 
455 

Gly Arg Gin Asn 
470 

Ser Glu Pro Val 
485 



Glu Asn Gly Asn 

220 

His Pro Asp Val 
235 

Asn Glu Leu Gin 
250 

Lys Phe Ser Phe 
265 

Gly Lys Glu Met 



Ala Leu Glu Asn 

300 

Asp Val Pro Leu 
315 

Gly Tyr Asp Met 
330 

Pro Leu Lys Ser 
345 

Gin Ser Leu Glu 



Ala Phe lie Leu 

380 

Asp Met Tyr Gly 
395 

Lys His Lys lie 
410 

Gly Ala Gin His 
425 

Arg Glu Gly Asp 



Thr Asp Gly Pro 

460 

Ala Gly Glu Thr 
475 

Val lie Asn Ser 
490 



Tyr Asp Tyr Leu 



Ala Ala Glu lie 

240 

Leu Asp Gly Phe 
255 

Leu Arg Asp Trp 
270 

Phe Thr Val Ala 
285 

Tyr Leu Asn Lys 



His Tyr Gin Phe 

320 

Arg Lys Leu Leu 
335 

Val Thr Phe Val 
350 

Ser Thr Val Gin 
365 

Thr Arg Glu Ser 



Thr Lys Gly Asp 

400 

Glu Pro lie Leu 
415 

Asp Tyr Phe Asp 
430 

Ser Ser Val Ala 
445 

Gly Gly Ala Lys 



Trp His Asp lie 

480 

Glu Gly Trp Gly 
495 



Glu Phe His Val Asn Gly Gly Ser Val Ser lie Tyr Val Gin Arg 

500 505 510 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro 
15 10 15 

Asn Asp Gly Gin His Trp Lys Arg Leu Gin Asn Asp Ser Ala Tyr Leu 

20 25 30 

Ala Glu His Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly 
35 40 45 

Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu 
50 55 60 

Gly Glu Phe His Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys 
65 70 75 80 

Gly Glu Leu Gin Ser Ala lie Lys Ser Leu His Ser Arg Asp lie Asn 

85 90 95 

Val Tyr Gly Asp Val Val lie Asn His Lys Gly Gly Ala Asp Ala Thr 

100 105 110 

Glu Asp Val Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val 
115 120 125 

lie Ser Gly Glu His Leu lie Lys Ala Trp Thr His Phe His Phe Pro 
130 135 140 

Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe 
145 150 155 160 

Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg lie Tyr Lys 

165 170 175 

Phe Gin Gly Lys Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Gly Asn 

180 185 190 

Tyr Asp Tyr Leu Met Tyr Ala Asp lie Asp Tyr Asp His Pro Asp Val 
195 200 205 

Ala Ala Glu lie Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gin 
210 215 220 

Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His lie Lys Phe Ser Phe 
225 230 235 240 



Leu Arg Asp Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met 
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Phe Thr Val Ala 

260 

Tyr Leu Asn Lys 
275 

His Tyr Gin Phe 
290 

Arg Lys Leu Leu 
305 

Val Thr Phe Val 



Ser Thr Val Gin 

340 

Thr Arg Glu Ser 
355 

Thr Lys Gly Asp 
370 

Glu Pro lie Leu 
385 

Asp Tyr Phe Asp 



Ser Ser Val Ala 

420 

Gly Gly Ala Lys 
435 

Trp His Asp lie 
450 

Glu Gly Trp Gly 
465 



245 

Glu Tyr Trp Gin 



Thr Asn Phe Asn 

280 

His Ala Ala Ser 
295 

Asn Gly Thr Val 
310 

Asp Asn His Asp 
325 

Thr Trp Phe Lys 



Gly Tyr Pro Gin 

360 

Ser Gin Arg Glu 
375 

Lys Ala Arg Lys 
390 

His His Asp lie 
405 

Asn Ser Gly Leu 



Arg Met Tyr Val 

440 

Thr Gly Asn Arg 
455 

Glu Phe His Val 
470 



250 

Asn Asp Leu Gly 
265 

His Ser Val Phe 



Thr Gin Gly Gly 

300 

Val Ser Lys His 
315 

Thr Gin Pro Gly 
330 

Pro Leu Ala Tyr 
345 

Val Phe Tyr Gly 



lie Pro Ala Leu 

380 

Gin Tyr Ala Tyr 
395 

Val Gly Trp Thr 
410 

Ala Ala Leu lie 
425 

Gly Arg Gin Asn 



Ser Glu Pro Val 

460 

Asn Gly Gly Ser 
475 



255 

Ala Leu Glu Asn 
270 

Asp Val Pro Leu 
285 

Gly Tyr Asp Met 



Pro Leu Lys Ser 

320 

Gin Ser Leu Glu 
335 

Ala Phe lie Leu 
350 

Asp Met Tyr Gly 
365 

Lys His Lys lie 



Gly Ala Gin His 

400 

Arg Glu Gly Asp 
415 

Thr Asp Gly Pro 
430 

Ala Gly Glu Thr 
445 

Val lie Asn Ser 



Val Ser lie Tyr 

480 



Val Gin Arg 



(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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Met Lys Gin Gin 
1 

Ala Leu lie Phe 

20 

Asn Gly Thr Leu 
35 

His Trp Lys Arg 
50 

lie Thr Ala Val 
65 

Asp Val Gly Tyr 



Gin Lys Gly Thr 

100 

Ser Ala lie Lys 
115 

Val Val lie Asn 
130 

Ala Val Glu Val 
145 

His Leu lie Lys 

165 

Thr Tyr Ser Asp 
180 

Trp Asp Glu Ser 
195 

Ala Trp Asp Trp 
210 

Met Tyr Ala Asp 



Lys Arg Trp Gly 

245 

Arg Leu Asp Ala 
260 

Val Asn His Val 
275 

Glu Tyr Trp Gin 
290 

Thr Asn Phe Asn 



Lys Arg Leu Tyr 
5 

Leu Leu Pro His 



Met Gin Tyr Phe 

40 

Leu Gin Asn Asp 
55 

Trp lie Pro Pro 
70 

Gly Ala Tyr Asp 
85 

Val Arg Thr Lys 



Ser Leu His Ser 

120 

His Lys Gly Gly 
135 

Asp Pro Ala Asp 
150 

Ala Trp Thr His 



Phe Lys Trp His 

185 

Arg Lys Leu Asn 
200 

Glu Val Ser Asn 
215 

lie Asp Tyr Asp 
230 

Thr Trp Tyr Ala 

Val Lys His lie 

265 

Arg Glu Lys Thr 
280 

Asn Asp Leu Gly 
295 

His Ser Val Phe 



Ala Arg Leu Leu 
10 

Ser Ala Ala Ala 
25 

Glu Trp Tyr Met 



Ser Ala Tyr Leu 

60 

Ala Tyr Lys Gly 
75 

Leu Tyr Asp Leu 
90 

Tyr Gly Thr Lys 
105 

Arg Asp lie Asn 



Ala Asp Ala Thr 

140 

Arg Asn Arg Val 
155 

Phe His Phe Pro 
170 

Trp Tyr His Phe 



Arg lie Tyr Lys 

205 

Glu Asn Gly Asn 
220 

His Pro Asp Val 
235 

Asn Glu Leu Gin 
250 

Lys Phe Ser Phe 

Gly Lys Glu Met 

285 

Ala Leu Glu Asn 
300 

Asp Val Pro Leu 



Thr Leu Leu Phe 
15 

Ala Ala Asn Leu 
30 

Pro Asn Asp Gly 
45 

Ala Glu His Gly 



Thr Ser Gin Ala 

80 

Gly Glu Phe His 
95 

Gly Glu Leu Gin 
110 

Val Tyr Gly Asp 
125 

Glu Asp Val Thr 



lie Ser Gly Glu 

160 

Gly Arg Gly Ser 
175 

Asp Gly Thr Asp 
190 

Phe Gin Gly Lys 



Tyr Asp Tyr Leu 

225 

Ala Ala Glu lie 
240 

Leu Asp Gly Phe 
255 

Leu Arg Asp Trp 
270 

Phe Thr Val Ala 



Tyr Leu Asn Lys 

305 

His Tyr Gin Phe 
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His Ala Ala Ser 

325 

Asn Gly Thr Val 
340 

Asp Asn His Asp 
355 

Thr Trp Phe Lys 
370 

Gly Tyr Pro Gin 



Ser Gin Arg Glu 

405 

Lys Ala Arg Lys 
420 

His His Asp lie 
435 

Asn Ser Gly Leu 
450 

Arg Met Tyr Val 



Thr Gly Asn Arg 

485 

Glu Phe His Val 
500 



310 

Thr Gin Gly Gly 



Val Ser Lys His 

345 

Thr Gin Pro Gly 
360 

Pro Leu Ala Tyr 
375 

Val Phe Tyr Gly 
390 

lie Pro Ala Leu 



Gin Tyr Ala Tyr 

425 

Val Gly Trp Thr 
440 

Ala Ala Leu lie 
455 

Gly Arg Gin Asn 
470 

Ser Glu Pro Val 



Asn Gly Gly Ser 

505 



315 

Gly Tyr Asp Met 
330 

Pro Leu Lys Ser 



Gin Ser Leu Glu 

365 

Ala Phe lie Leu 
380 

Asp Met Tyr Gly 
395 

Lys His Lys lie 
410 

Gly Ala Gin His 



Arg Glu Gly Asp 

445 

Thr Asp Gly Pro 
460 

Ala Gly Glu Thr 
475 

Val lie Asn Ser 
490 

Val Ser lie Tyr 



320 

Arg Lys Leu Leu 
335 

Val Thr Phe Val 
350 

Ser Thr Val Gin 



Thr Arg Glu Ser 

385 

Thr Lys Gly Asp 
400 

Glu Pro lie Leu 
415 

Asp Tyr Phe Asp 
430 

Ser Ser Val Ala 



Gly Gly Ala Lys 

465 

Trp His Asp lie 
480 

Glu Gly Trp Gly 
495 

Val Gin Arg 
510 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Arg Gly Arg Gly Asn Met lie Gin Lys Arg Lys Arg Thr Val Ser 
15 10 15 

Phe Arg Leu Val Leu Met Cys Thr Leu Leu Phe Val Ser Leu Pro lie 

20 25 30 

Thr Lys Thr Ser Ala Val Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp 
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35 

Tyr Thr Pro Asn 
50 

Glu His Leu Ser 
65 

Tyr Lys Gly Leu 



Tyr Asp Leu Gly 

100 

Gly Thr Lys Ser 
115 

Asn Val Gin Val 
130 

Asp Ala Thr Glu 
145 

Asn Gin Glu Thr 



Arg Phe Pro Gly 

180 

Tyr His Phe Asp 
195 

lie Phe Lys Phe 
210 

Ser Glu Asn Gly 
225 

Asp His Pro Asp 



Ala Asn Glu Leu 

260 

lie Lys Phe Ser 
275 

Thr Gly Lys Glu 
290 

Gly Lys Leu Glu 
305 

Phe Asp Val Pro 



Asp Gly Gin His 
55 

Asp lie Gly lie 
70 

Ser Gin Ser Asp 
85 

Glu Phe Gin Gin 



Glu Leu Gin Asp 

120 

Tyr Gly Asp Val 
135 

Asp Val Thr Ala 
150 

Ser Glu Glu Tyr 
165 

Arg Gly Asn Thr 



Gly Ala Asp Trp 

200 

Arg Gly Glu Gly 
215 

Asn Tyr Asp Tyr 
230 

Val Val Ala Glu 
245 

Ser Leu Asp Gly 



Phe Leu Arg Asp 

280 

Met Phe Thr Val 
295 

Asn Tyr Leu Asn 
310 

Leu His Phe Asn 
325 



Trp Lys Arg Leu 

60 

Thr Ala Val Trp 
75 

Asn Gly Tyr Gly 
90 

Lys Gly Thr Val 
105 

Ala lie Gly Ser 



Val Leu Asn His 

140 

Val Glu Val Asn 
155 

Gin lie Lys Ala 
170 

Tyr Ser Asp Phe 
185 

Asp Glu Ser Arg 



Lys Ala Trp Asp 

220 

Leu Met Tyr Ala 
235 

Thr Lys Lys Trp 
250 

Phe Arg lie Asp 
265 

Trp Val Gin Ala 



Ala Glu Tyr Trp 

300 

Lys Thr Ser Phe 
315 

Leu Gin Ala Ala 
330 



Gin Asn Asp Ala 



lie Pro Pro Ala 

80 

Pro Tyr Asp Leu 
95 

Arg Thr Lys Tyr 
110 

Leu His Ser Arg 
125 

Lys Ala Gly Ala 



Pro Ala Asn Arg 

160 

Trp Thr Asp Phe 
175 

Lys Trp His Trp 
190 

Lys lie Ser Arg 
205 

Trp Glu Val Ser 



Asp Val Asp Tyr 

240 

Gly lie Trp Tyr 
255 

Ala Ala Lys His 
270 

Val Arg Gin Ala 
285 

Gin Asn Asn Ala 



Asn Gin Ser Val 

320 

Ser Ser Gin Gly 
335 



Gly Gly Tyr Asp Met Arg Arg Leu Leu Asp Gly Thr Val Val Ser Arg 
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340 345 350 

His Pro Glu Lys Ala Val Thr Phe Val Glu Asn His Asp Thr Gin Pro 
355 360 365 

Gly Gin Ser Leu Glu Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala 
370 375 380 

Tyr Ala Phe lie Leu Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr 
385 390 395 400 

Gly Asp Met Tyr Gly Thr Lys Gly Thr Ser Pro Lys Glu lie Pro Ser 

405 410 415 

Leu Lys Asp Asn lie Glu Pro lie Leu Lys Ala Arg Lys Glu Tyr Ala 

420 425 430 

Tyr Gly Pro Gin His Asp Tyr lie Asp His Pro Asp Val lie Gly Trp 
435 440 445 

Thr Arg Glu Gly Asp Ser Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu 
450 455 460 

lie Thr Asp Gly Pro Gly Gly Ser Lys Arg Met Tyr Ala Gly Leu Lys 
465 470 475 480 

Asn Ala Gly Glu Thr Trp Tyr Asp lie Thr Gly Asn Arg Ser Asp Thr 

485 490 495 

Val Lys lie Gly Ser Asp Gly Trp Gly Glu Phe His Val Asn Asp Gly 

500 505 510 

Ser Val Ser lie Tyr Val Gin Lys 
515 520 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 548 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Val Leu Thr Phe His Arg lie lie Arg Lys Gly Trp Met Phe Leu Leu 
15 10 15 

Ala Phe Leu Leu Thr Ala Ser Leu Phe Cys Pro Thr Gly Arg His Ala 

20 25 30 

Lys Ala Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp 
35 40 45 



Tyr Leu Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala 



WO 99/09183 PCT/US98/16906 

— 11 — 



50 

Asn Asn Leu Ser 
65 

Tyr Lys Gly Thr 



Tyr Asp Leu Gly 

100 

Gly Thr Lys Ala 
115 

Gly Met Gin Val 
130 

Asp Gly Thr Glu 
145 

Asn Gin Glu lie 



Asp Phe Pro Gly 

180 

Tyr His Phe Asp 
195 

lie Tyr Lys Phe 
210 

Thr Glu Asn Gly 
225 

Asp His Pro Glu 



Val Asn Thr Thr 

260 

lie Lys Phe Ser 
275 

Thr Gly Lys Pro 
290 

Asn Lys Leu His 
305 

Phe Asp Ala Pro 



Gly Ala Phe Asp 

340 



Ser Leu Gly lie 
70 

Ser Arg Ser Asp 
85 

Glu Phe Asn Gin 



Gin Tyr Leu Gin 

120 

Tyr Ala Asp Val 
135 

Trp Val Asp Ala 
150 

Ser Gly Thr Tyr 
165 

Arg Gly Asn Thr 



Gly Val Asp Trp 

200 

Arg Gly lie Gly 
215 

Asn Tyr Asp Tyr 
230 

Val Val Thr Glu 
245 

Asn lie Asp Gly 



Phe Phe Pro Asp 

280 

Leu Phe Thr Val 
295 

Asn Tyr lie Thr 
310 

Leu His Asn Lys 
325 

Met Arg Thr Leu 



Thr Ala Leu Ser 
75 

Val Gly Tyr Gly 
90 

Lys Gly Thr Val 
105 

Ala lie Gin Ala 



Val Phe Asp His 

140 

Val Glu Val Asn 
155 

Gin lie Gin Ala 
170 

Tyr Ser Ser Phe 
185 

Asp Glu Ser Arg 



Lys Ala Trp Asp 

220 

Leu Met Tyr Ala 
235 

Leu Lys Asn Trp 
250 

Phe Arg Leu Asp 
265 

Trp Leu Ser Tyr 



Gly Glu Tyr Trp 

300 

Lys Thr Asn Gly 
315 

Phe Tyr Thr Ala 
330 

Met Thr Asn Thr 
345 



Leu Pro Pro Ala 

80 

Val Tyr Asp Leu 
95 

Arg Thr Lys Tyr 
110 

Ala His Ala Ala 
125 

Lys Gly Gly Ala 



Pro Ser Asp Arg 

160 

Trp Thr Lys Phe 
175 

Lys Trp Arg Trp 
190 

Lys Leu Ser Arg 
205 

Trp Glu Val Asp 



Asp Leu Asp Met 

240 

Gly Lys Trp Tyr 
255 

Gly Leu Lys His 
270 

Val Arg Ser Gin 
285 

Ser Tyr Asp lie 



Thr Met Ser Leu 

320 

Ser Lys Ser Gly 
335 

Leu Met Lys Asp 
350 



Gin Pro Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Asn Pro 
355 360 365 
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Ala Lys Arg Cys Ser His Gly Arg Pro Trp Phe Lys Pro Leu Ala Tyr 
370 375 380 



Ala Phe lie Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly 
385 390 395 400 

Asp Tyr Tyr Gly lie Pro Gin Tyr Asn lie Pro Ser Leu Lys Ser Lys 

405 410 415 

lie Asp Pro Leu Leu lie Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin 

420 425 430 

His Asp Tyr Leu Asp His Ser Asp lie lie Gly Trp Thr Arg Glu Gly 
435 440 445 

Val Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu lie Thr Asp Gly 
450 455 460 

Ala Gly Arg Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys 
465 470 475 480 

Val Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr lie Asn 

485 490 495 

Ser Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val 

500 505 510 

Trp Val Pro Arg Lys Thr Thr Val Ser Thr lie Ala Arg Pro lie Thr 
515 520 525 

Thr Arg Pro Trp Thr Gly Glu Phe Val Arg Trp His Glu Pro Arg Leu 
530 535 540 

Val Ala Trp Pro 
545 



(2) INFORMATION FOR SEQ ID NO: 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GATCAAAACA TAAAAAACCG GCCTTGGCCC CGCCGGTTTT TTATTATTTT TGAGCT 5 6 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 48 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CAAAAATAAT AAAAAACCGG CGGGGCCAAG GCCGGTTTTT TATGTTTT 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCCATTAAGA TTGGCCGCCT GGG CCGAC AT GTTGCTGG 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCAGCCGACA ATGTCATGGT CGTCGAAATA ATC 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCTGATGTCG CAACAGAAAT TAAGAGATGG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
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( B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



GTCGCCTTCC CTTGCCCAGC CGACAATGTC 
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